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SEQUENCE LISTING 



<110> KAWAOKA, Akiyoshi 
EBINUMA, Hiroyasu 

<12 0> TRANSCRIPTION FACTOR CONTROLLING PHENYLPROPANOID 
BIOSYNTHESIS PATHWAY 

<130> 4859-0027-0 

<140> 09/282,146 
<141> 1999-03-31 

<150> JP 10-125171 
<151> 1998-03-31 

<16d| 13 

s 55" 

<170> Patentin Ver. 2.1 

.3 .a. 

<2l6t 1 
<2liU' 988 
<212> DNA 

<213> Nicotiana tabacum 

<22m> 

<22^ CDS 

<222=> (100) . . (702) 

a? 

<22l'd> 

<221> misc_f eature 
<222> (127) . . (282) 
<223> LIM domain 

<220> 

<221> misc_f eature 
<222> (427) . . (582) 
<223> LIM domain 

<400> 1 

gaattcgcgg ccgttccaaa aaccaagtgc taacacaaag aaagggaaag agccacaaag 60 

accatttttg ttttctgtaa aacttgctcg tatatagcc atg get ttt gca gga 11 

Met Ala Phe Ala Gly 
1 5 

acc aca cag aaa tgc atg gca tgt gac aag act gtc tat ctg gtt gac 16 
Thr Thr Gin Lys Cys Met Ala Cys Asp Lys Thr Val Tyr Leu Val Asp 



-1- 



10 15 20 

aaa tta act gca gat aac aga ate tat cac aaa get tgt ttc aga tgc 210 

Lys Leu Thr Ala Asp Asn Arg lie Tyr His Lys Ala Cys Phe Arg Cys 

25 30 35 



cat cac tgc aag ggc act gtc aag ctt ggc aac tac aat tec ttt gag 

His His Cys Lys Gly Thr Val Lys Leu Gly Asn Tyr Asn Ser Phe Glu 
40 45 50 

gga gtt eta tac tgt aga cea cac ttt gat cag etc ttc aaa caa act 

Gly Val Leu Tyr Cys Arg Pro His Phe Asp Gin Leu Phe Lys Gin Thr 
55 60 65 



tgc t&: caa tgc age cac gga ggc tgt gta ata age ect tec aac tat 
Cys E^rife Gin Cys Ser His Gly Gly Cys Val lie Ser Pro Ser Asn Tyr 
135 140 145 



aat tee aeg aca aca aca gga gtt act gca gag tea tac aca gee gae 
Asn Ser Thr Thr Thr Thr Gly Val Thr Ala Glu Ser Tyr Thr Ala Asp 
185 190 195 



258 



306 



ggc agt ttg gat aaa age ttt gaa ggt aca cea aaa aat gtg aag cea 3 54 

Gly Ser Leu Asp Lys Ser Phe Glu Gly Thr Pro Lys Asn Val Lys Pro 
70 75 80 85 

cag a^Sa cec att gae agt gag aaa cea cag gta gee aaa gtg aca age 4 02 

Gin Ly^ Pro lie Asp Ser Glu Lys Pro Gin Val Ala Lys Val Thr Ser 

^3 90 95 100 

m 

atg ftt3: ggt gga aca aga gag aaa tgt ttt ggc tgc aag aaa act gtc 

Met PiiTe Gly Gly Thr Arg Glu Lys Cys Phe Gly Cys Lys Lys Thr Val 
105 110 115 

tac cea aca gaa aag gta tea gee aat ggc aeg cea tac cat aag age 4 98 

Tyr Pigo Thr Glu Lys Val Ser Ala Asn Gly Thr Pro Tyr His Lys Ser 
il 120 125 130 



450 



546 



ace gca cat gag ggg cge tta tat tgt aaa eat cac cat att caa ctt 5 94 
Thr Ala His Glu Gly Arg Leu Tyr Cys Lys His His His lie Gin Leu 
150 155 160 165 

ate aag gag aag ggc aac tta age aag ctt gag ggt gae cat gaa atg 642 
lie Lys Glu Lys Gly Asn Leu Ser Lys Leu Glu Gly Asp His Glu Met 

170 175 180 



690 



caa gtt gat tga teettatett taccgegate atgtattaeg tatctgetgt 742 
Gin Val Asp 
200 
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# # 



tagttgtaag aatcgaaggc gttcagcagc ttccatgaat gcacttgcct tgccccagcg 802 

tatgttttac tctaatctag cttcaattaa tttgatgttg aactatatat tgtctagctt 862 

ttgtgtgtag atttttgacc tttgtttgct tgtgcttcac ttgtattatg tgaatgttga 922 

atgagattga atataacatg gttttgctgt cccagtgcat gcaaatcttt gagcggccgc 982 
gaattc 



<210> 2 
<211> 200 
<212> PRT 

<213> Nicotiana tabacum 
<400> 2 

Meti'^la Phe Ala Gly Thr Thr Gin Lys Cys Met Ala Cys Asp Lys Thr 

l3 5 10 15 

Valuiryr Leu Val Asp Lys Leu Thr Ala Asp Asn Arg lie Tyr His Lys 

fij 20 25 30 

AlaCOCys Phe Arg Cys His His Cys Lys Gly Thr Val Lys Leu Gly Asn 

35 40 45 

Tyit"Asn Ser Phe Glu Gly Val Leu Tyr Cys Arg Pro His Phe Asp Gin 

rU 50 55 60 

Le^|=.Phe Lys Gin Thr Gly Ser Leu Asp Lys Ser Phe Glu Gly Thr Pro 
eii 70 75 80 

Lys^Asn Val Lys Pro Gin Lys Pro lie Asp Ser Glu Lys Pro Gin Val 

''t 85 90 95 

Alp Lys Val Thr Ser Met Phe Gly Gly Thr Arg Glu Lys Cys Phe Gly 

lI 100 105 110 

Cys Lys Lys Thr Val Tyr Pro Thr Glu Lys Val Ser Ala Asn Gly Thr 

115 120 125 

Pro Tyr His Lys Ser Cys Phe Gin Cys Ser His Gly Gly Cys Val lie 

130 135 140 

Ser Pro Ser Asn Tyr Thr Ala His Glu Gly Arg Leu Tyr Cys Lys His 
145 150 155 160 

His His lie Gin Leu lie Lys Glu Lys Gly Asn Leu Ser Lys Leu Glu 

165 170 175 

Gly Asp His Glu Met Asn Ser Thr Thr Thr Thr Gly Val Thr Ala Glu 

180 185 190 

Ser Tyr Thr Ala Asp Gin Val Asp 
195 200 



<210> 3 
<211> 17 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Synthetic DNA 
<400> 3 

tgccaccaaa ctcctac 



<210> 4 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Synthetic DNA 



<22:@> 

<22l> Description of Artificial Sequence : Synthetic DNA 



ctfefeaccaac ccccatc 



<210> 6 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Synthetic DNA 
<400> 6 

ctccaacaaa ccccttc 



<210> 7 
<211> 17 
<212> DNA 

<213> Artificial Sequence 



<4Q0;> 4 



ctcgaccaac ccccttc 



17 




<4q§> 5 
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: 4 i 

V 

<220> 

<223> Description of Artificial Sequence : Synthetic DNA 
<400> 7 

tctcaacaac tcctcct 



<210> 8 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Synthetic DNA 



<400> 8 

tgccaactga cccgtag 

<2iti> 9 

<2l£> 17 
<2lJ^|> DNA 

<2i;|> Artificial Sequence 
<22l> 

<22G> Description of Artificial Sequence : Synthetic DNA 
<40f> 9 

acdqaactaa ccccggc 



<2i0> 10 
<211> 17 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence : Synthetic DNA 
<400> 10 

atccaacaaa taacaca 



<210> 11 

<211> 17 

<212> DNA 

<213> Artificial 

<220> 



Sequence 
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4 




<223> Description of Artificial Sequence : Synthetic DNA 
<400> 11 

caccacttga gtacaaa 



<210> 12 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Synthetic DNA 

<400> 12 
ccaacaaacc cc 



<2W> 13 
<2ia.> 12 
<2l§> DNA 



<2i3> Artificial Sequence 



<2i^p> 



<2iJ> Description of Artificial Sequence : Synthetic DNA 



<4.ey)> 13 
cc|8:ttgagt ac 



12 
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SEQUENCE LENGTH: 200 
SEQUENCE TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 



SEQUENCE 

Met Ala Phe Ala Gly Thr Thr Gin 

1 5 

Val Tyr Leu Val Asp Lys Leid'^hr 
20 

Ala Cys Phe Arg Cy's His t^is Cy 

35 / 40 

Tyr Asn Ser Phe Glu Gly Val /^Le 

50 55 

Leu Phe Lys Gin Thr Gly Ser 

65 70 I / 

Lys lie Val Lys Pro Gin i?ys Pro 
85 

Ala Lys Val Thr Ser pW Gly 
100 

Cys Lys Lys Thr Val/ Tyr Pro Thr 

125 / 120 
Pro Tyr His Lys ^er Cys Phe Gin 

130 / 135 

Ser Pro Ser A^n Tyr Thr Ala His 

145 / 150 

His His Il^Gln Leu lie Lys Glu 
165 

Gly Asp ^is Glu Met Asn Ser Thr 
180 

Ser Ty/ Thr Ala Asp Gin Val Asp 

195 200 



Lys Cys H^t 
10, 

Ala A^ Asn 

25 

L^ Gly Thr 

'Tyr Cys Arg 

Asj^ Lys Ser 
75 

lie Asp Ser 
90 

Gly Thr Arg 
105 

Glu Lys Val 

Cys Ser His 

Glu Gly Arg 
155 

Lys Gly Asn 
170 

Thr Thr Thr 
185 



Val Lys 
45 

Pro His 

60 
Phe Glu 

Glu Lys 

Glu Lys 

Ser Ala 
125 
Gly Gly 
140 

Leu Tyr 
Leu Ser 
Glu Val 



Asp 


Lys 


Thr 


16 




15 






Tyr 


His 


Lys 


32 










Leu 


Ljiy 


Asn 


A Q 


r-\ u, ^ 

Pne 


Asp 


Gin 




Gly 


Thr 


Pro 


80 






80 




Pro 


Gin 


Val 


96 




95 






Cys 


Phe 


Gly 


112 


110 








Asn 


Gly 


Thr 


128 


Cys 


Val 


He 


144 


Cys 


Lys 


His 


160 






160 




Lys 


Leu 


Glu 


176 




175 






Thr 


Ala 


Glu 


192 


190 









200 
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SEQUENCE LISTING 



'•I -J 

m 



SEQ ID NO: 1 
SEQUENCE LENGTH: 988 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 
MOLECULE TYPE: cDNA to mRNA 
ORIGINAL SOURCE 

ORGANISM: Nicotiana tobacum 

Tissue tepy: leaf 
FEATURE 

NAME/KEY: CDS 

LOCATION: 100. .702 

IDENTIFICATION METHOD: E 

NAME /KEY: domain 
LOCATION: 127 . . 282 
IDENTIFICATION METHOD: S 
ANOTHER INFORMATION: /?M:M 
mot i f 

NAME/KEY: domain 
LOCATION: 4 27.. 582 
IDENTIFICATION METHOEj 
ANOTH ER I NFORMAT I ON : 
moti f 




kind 



zing finger 



main which is a kind of the zing finger 



SEQUENCE 

GAATTCGCGG CCGTTCC 
ACCATTTTTG TTTTCTG 
ACC ACA CAG AAA T^ 
AAA TTA ACT GCA 
CAT CAC TGC 
GGA GTT CTA 
GGC ACT TTG 
CAG AAA CCC 
ATG TTT GG 
TAC CCA A 
TGC TTC 



AACCAAGTGC TAACACAAAG AAAGGGAAAG AGCCACAAAG 
fpj\ AACTTGCTCG TATATAGCC ATG GCT TTT GCA GGA 
ATG GCA TGT GAC AAG ACT GTC TAT CTG GTT GAC 
AT AAC AGA ATC TAT CAC AAA GCT TGT TTC AGA TGC 
GGC ACT GTC AAG CTT GGC AAC TAC AAT TCC TTT GAG 
TGT AGA CCA CAC TTT GAT CAG CTC TTC AAA CAA ACT 
AAA AGC TTT GAA GGT ACA CCA AAA ATT GTG AAG CCA 
,TT GAC AGT GAG AAA CCA CAG GTA GCC AAA GTG ACA AGC 
GGA ACA AGA GAG AAA TGT TTT GGC TGC AAG AAA ACT GTC 
GAA AAG GTA TCA GCC AAT GGC ACG CCA TAC CAT AAG AGC 
TGC AGC CAC GGA GGC TGT GTA ATA AGC CCT TCC AAC TAT 



ACC GCA /CAT GAG GGG CGC TTA TAT TGT AAA CAT CAC CAT ATT CAA CTT 
ATC AAg/ gag AAG GGC AAC TTA AGC AAG CTT GAG GGT GAC CAT GAA ATG 
T9C ACG ACA ACA ACA GAA GTT ACT GCA GAG TCA TAC ACA GCC GAC 

TT GAT TGA TCCTTATCTT TACCGCGATC ATGTATTACG TATCTGCTGT 
TGTAAG AATCGAAGGC GTTCAGCAGC TTCCATGAAT GCACTTGCCT TGCCCCAGCG 
jTTTTAC TCTAATCTAG CTTCAATTAA TTTGATGTTG AACTATATAT TGTCTAGCTT 
TTyfeTGTGTAG ATTTTTGACC TTTGTTTGCT TGTGCTTCAC TTGTATTATG TGAATGTTGA 
a/gagattga ATATAACATG GTTTTGCTGT CCCAGTGCAT GCAAATCTTT GAGCGGCCGC 
5AATTC 

SEQ ID NO: 2 
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60 
114 
162 
210 
258 
306 
354 
402 
450 
498 
546 
594 
642 
690 
742 
802 
862 
922 
932 
988 



